An improved Fuzzy Clustering methodology applied to the study of Protein Conformational Ensembles
نویسندگان
چکیده
Clustering is a technique that aims to group data objects. Various similarity measures such as Euclidean, city-block, Mahalanobis distances and coisine similarity [1] have been used for discovering the underlying structures in data. Formally, the problem of clustering may be described as follows: Given a set of data objects X = {x1, x2, . . . , xn}, a clustering algorithm determines a suitable number k of homogeneous groups, and maps the data points to the labels in the set C = {1, 2, . . . , k}, where each label identifies a homogeneous group of objects. A good clustering algorithm should have the following characteristics: it should be scalable, i.e., perform well on data sets having large number of objects and also large number of attributes, should be able to determine clusters of varying shape and size, should have least requirement of domain knowledge (e.g., number of clusters, thresholds, termination condition parameters), should work well in the presence of noise and outliers, and should be insensitive to order of objects [2]. We present in this work an improved fuzzy clustering algorithm [3] that fulfills those criteria and we show its application for the classification of protein conformational ensembles, problem that arises in many domains of structural bioinformatics.
منابع مشابه
Water Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis
Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters
 
In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy cluster...
متن کاملWater Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis
Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy clustering ...
متن کاملOPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...
متن کاملNew Approach for Customer Clustering by Integrating the LRFM Model and Fuzzy Inference System
This study aimed at providing a systematic method to analyze the characteristics of customers’ purchasing behavior in order to improve the performance of customer relationship management system. For this purpose, the improved model of LRFM (including Length, Recency, Frequency, and Monetary indices) was utilized which is now a more common model than the basic RFM model apt for analyzing the cus...
متن کاملA Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm
Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014